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ABSTRACT. Minimum distance is an important parameter of a linear error correcting code. For improved performance of binary 
Low Density Parity Check (LDPC) codes, we need to have the minimum distance grow fast with n, the codelength. However, the 
best we can hope for is a linear growth in ci m j n with n. For binary LDPC codes, the necessary and sufficient conditions on the 
LDPC ensemble parameters, to ensure linear growth of minimum distance is well established. In the case of non-binary LDPC 
codes, the structure of logarithmic weight codewords is different from that of binary codes. We have carried out a preliminary study 
on the logarithmic bound on the the minimum distance of non-binary LDPC code ensembles. In particular, we have investigated 
certain configurations which would lead to low weight codewords. A set of simulations are performed to identify some of these 
configurations. Finally, we have provided a bound on the logarithmic minimum distance of nonbinary codes, using a strategy 
similar to the girth bound for binary codes. This bound has the same asymptotic behaviour as that of binary codes. 
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1. Introduction 

In this report, we present some findings of a preliminary investigation on the minimum distance growth of non binary LDPC codes. 
We present some aspects of the necessary conditions for the growth of minimum distance for non binary LDPC codes. 

2. Low-Density Parity-Check codes 

Low-Density Parity-Check codes (LDPC) are class of linear error correcting codes originally proposed by Gallager in the 1960s fll . 
LDPC codes are among the capacity achieving codes, with superior performance under iterative decoding algorithms which are easy to 
implement with affordable complexity [ 2 1 . 

2.1. Binary LDPC Codes: Construction and parameters. A linear binary code C of length n is a linear subspace of F£ . If C has 
dimension k, then 6 is referred as C [n, k] code [ 3 1 . The code C forms a linear map from all possible binary k tuples to a k dimensional 
vector space F£ over Fa, A codeword c £ C is an element in the vector space Fjf . The linear mapping is represented by a generator 
matrix G £ Fj xk ■ Being a linear subspace of dimension k, the code C can also be described as the kernel of a matrix H £ jp( n - fc ) xn ^ 
so that C = {c £ FJ \Hc = 0} (We treat codewords c as column vectors for this description). The matrix H £ j^ n-fe ) xn j s ca lled 
parity check matrix. The generator and parity check matrices are related by HG = 0. 

LDPC codes belong to the class of linear codes. A LDPC code is defined as follows (4). 

Definition 1. A low density parity check code is a linear block code which has a sparse parity check matrix. 

Here, sparse refers to the condition of having at most a fixed constant number of l's (the constant is independent of n) 0. The name 
low density in LDPC is attributed to this sparsity feature of the parity check matrix. 

The number of ones in a binary vector is referred as its weight. For a matrix, row (column) weight denote the weights of the 
corresponding row(column) vector. A LDPC code is regular, if all the column weights are the same (say /) and all the row weights (say 
r) are the same, for the parity check matrix H £ ]p™ xn (Note that, the number of parity equations are usually, denoted as m instead 
of (n — k) as described for linear codes in general. We adopted this commonly used notation in LDPC literature [2|). This represent a 
regular (n, I, r) — LDPC code. The relationship In — rm holds. The parameters l(r) are also known as variable(check) node degrees 
for (n,l,r) - LDPC code. 

The design rate of a (n, l,r) — LDPC is defined R = 1 — I jr. A toy example, of a regular parity matrix with n = 10, I = 3 and 
r — 6 is represented below. 

"1 11001100 1" 
10 10 110 110 
0011101011 

10 1110 10 1 

1 10100111 0_ 

When the column (row) weights are not identical across columns (rows), such matrices correspond to what is known as left (right) 
irregular LDPC codes. We can describe them in terms of the column (row) weight distributions. An easier and more convenient 
representation using a graphical method is widely used, which we discuss next. 

2.2. Tanner graphs. Tanner introduced in [6 1 a convenient graphical representation of LDPC codes in terms of a bipartite graph. Such 
a representation also known as Tanner graph, is often useful in the encoding , decoding, as well as in the analysis of LDPC codes [2|. 
Bipartite representation and parity matrix representation are essentially synonymous. A bipartite representation of a simple parity check 
matrix Eq.([T} is illustrated in Figure[TJ 

0" 

1 

1 

1 1 



1 1_ 

A bipartite graph consists of a set of variable nodes and a set of check nodes, together with edges connecting pair of nodes, of 
different type (An edge neither connects a variable node to variable node, nor from a check node to check node). The variable nodes 
(shown as circular nodes in Figure[T) represent the elements (bits) of codeword (xi , x%, . . . , x n ) and check nodes (shown as rectangular 
nodes in FigureQ} represent m parity equations. An edge a t j is connected from a variable node Xi to a check node Cj if the element 
Hj,i is 1 (nonzero). The number of edges connected to a variable (check) node is referred to as the degree of the corresponding variable 
(check) node. For regular (n, I, r) — LDPC, this result in l(r) neighbours for all the variable (check) nodes. The column (row) 
indices of H maps to the variable (check) nodes in bipartite graph. Thus, the weight of a column (row) simply equals to the degree of 
corresponding variable (check) node in bipartite graph. 
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Figure 1. Bipartite representation a code: The circular nodes are called variable nodes and square boxes referred 
as check nodes representing parity conditions. The number of variable nodes equal the code length and the number 
of check nodes equal to the number of parity equations. In this illustration, the codelength n — 8 and the number of 
parity conditions m = 6. The graph maps to the parity check matrix H in Eq.QJ where the row index correspond 
to the check nodes and the column indices to variable nodes. This is an example of irregular code. 



An irregular LDPC code has a sparse parity check matrix in which the column (row) weight may vary from column (row) to column 
(row). In such cases, it is useful to talk about the distribution of the weights on column(row) of the H matrix. The bipartite graph 
illustration in Figure[T]is an example of irregular code. 

2.3. Degree distribution pairs. We have seen that, the weight of a column (row) of H is the same as the degree of corresponding 
variable (check) node in the bipartite graph. For a general bipartite graph we could then define the degree distribution, which equivalently 
translate to the non zero elements of the representing parity check matrix. Let Li(Rj) denote the fraction of variable (check) nodes 
with exactly i edges connected to them. Let 2 m ax(r ma x) denote the maximum number of edges connected to any variable (check) node. 
The polynomial L(x) = ^. maK LiX 1 is the variable node degree distribution and R(x) = ^ rmax Rjx 3 is the check node degree 
distribution. These are usually described as pair (L, R) and is referred to as degree distribution pair from node perspective. There is an 
equivalent definition of degree distribution from edge perspective, denoted as (A, p). The polynomials are {\(x) = ]T\ Aja;* , p(x) — 
^2- pjX 3 ~ ) where Ai is the fraction of edges which are connected to variable nodes of degree i. Similarly, pj denote the fraction of 
edges which are connected to check nodes of degree j. For a (n, I, r) — LDPC-regular code we have: \(x) — x l ~ x and p(x) — x r ~ Y 
or equivalently, L(x) — x , R(x) = x r 

2.4. Ensemble of binary LDPC codes. It is of interest to study the ensemble properties of LDPC codes, rather than that of an isolated 
code. For large n, the performance of any code is found to be close to that of an ensemble |2l . 

Given a pair (A, p) of degree distributions and the block length n, an ensemble of bipartite graphs G(A, p) is defined by the collection 
of graphs by running over all possible permutations of edges connecting variable nodes and check nodes, subject to the given degree 
distribution pair (A, p). 

In what follows, when we discuss properties and performance of LDPC codes, we always refers to that of LDPC ensemble. 

3. Minimum distance of LDPC Codes 

Minimum distance is an important design parameter for a linear code 1 3 1. The definition of minimum distance for LDPC is the same 
as that of any linear code. 

Definition 2. Minimum distance is the smallest Hamming distance between any two codewords of the code. 

More precisely, it is the smallest (among all codeword pairs) number of difference in bit values at individual positions of any two 
codewords. Minimum distance is denoted by d m m. 

3.1. Necessary condition to have linear growth in minimum distance. The behaviour of the minimum distance of a LDPC code 
ensemble as the codelength n increases is referred as minimum distance growth. For improved decoding performance, we wish to have 
dmin grow fast with n, the codelength 1 7 1 1 8 1 . However, the best we can achieve is a linear growth of d m i n with n. We want to construct 
LDPC ensemble with linear minimum distance growth. 

In order to achieve this growth, we have to admit necessary and sufficient conditions on ensemble parameters, more precisely 
conditions on degree distribution pairs (A, p). If (A, p) satisfies the necessary conditions, we could achieve minimum distance growth 
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better than sub-linear (more precisely speaking, logarithmic). However, necessary condition does not guarantee linear minimum distance 
growth. To ensure minimum distance growth faster than logarithmic, we must avoid codewords of logarithmic minimum distance. 




Figure 2. Minimum distance growth growth: The yaxis correspond to the growth in minimum distance, whereas, 
the xaxis show the codelength. The region between the two curves correspond to minimum distance growth faster 
than logarithmic and slower than linear. The bottom curve shows a logarithmic growth in minimum distance 
whereas, the top curve correspond to the linear minimum distance growth. 



For binary codes, the necessary condition translates to the following: 

If the number of degree 2 variable nodes (712) is equal to the number of check nodes m, each cycle correspond to a 
valid codeword.. When n<i < m, the minimum distance growth is at most logarithmic in n. 

4. Non-binary LDPC Codes 

A non-binary LDPC code can be defined analogous to the binary LDPC code. These are linear codes defined as vector space ¥ q over 
F 9 , where q is a prime power, i.e., (q — p a ) with p prime and a £ N a positive number. We will restrict attention only to cases where q 
is of the form 2 a ,i.e., that is codes defined over extension fields of binary fields. A non-binary code denoted C[n, k] q can be described 
as the kernel of the parity check matrix H £ F 9 n ~ fc)xn , such that 6 = {c £ F"|Hc = 0}. 

Definition 3. A low density parity check code is a linear block code, which has a sparse parity check matrix. The sparse elements are 
elements from ¥ q . The code is defined over a finite field ¥ q . 

Similar to the binary code, a non binary code can be described using the Tanner graph representation of the corresponding parity 
check matrix. A non binary LDPC code defined over ¥ q with parity check matrix H £ jp"( n - fc ) xn jj as a q_ rv bipartite graph repre- 
sentation. The edge labels, variable nodes (q— ry codeword G F 9 ) in g— ry bipartite graph assume nonzero values from F 9 . The linear 
operations (parity check equations) are as well carried out in the usual finite field algebra in ¥ q . We provide an example of a 4— ry 
bipartite graph representation (See Figure[3j of a parity check matrix defined over F4 shown below (Eq|2j. 
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In the example [2|, the primitive polynomial generating F4 is p(x) = 1 + x + x 2 . The primitive root is a and the elements of F4 are 

{0, l,a, 1 + a}. 

4.1. Code Ensemble. The code ensemble of non-binary LDPC codes is defined in analogous way |2| as linear LDPC codes, with the 
exception that, the non-zero entries of the parity check matrix H are chosen uniformly at random from F*, where F* = ¥ q \ {0}. In 
words, the edge labels are chosen uniformly from the non-zero elements of ¥ q , subject to the degree distribution pair (A, p). 

4.2. Minimum distance of non-binary LDPC codes. For codes defined over F 9 with q — 2 a , there are two possible definitions of 
minimum distance for non-binary LDPC codes (linear codes in general). One of them is the Hamming minimum distance, denoted by 
dflmin- The Hamming minimum distance d_H m i n is simply the minimum (among among all codewords) number of difference in symbol 
positions between any two codewords in C. The second definition of minimum distance is in terms of the binary image representation 
of the q— ry codeword. We adopt this latter definition of minimum distance for our investigation. 
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Figure 3. Bipartite graph of a non binary code with parity check matrix H code defined over F4. The 
primitive polynomial generating F4 is p(x) = 1 + x + x 2 . The primitive root is a and the elements of 
F4 are {0, 1, a, 1 + a}. The different edge labels are marked in different color (and thickness). Variable 
nodes are represented by circles and check nodes by square boxes. 



Definition 4. Minimum distance of non-binary g— LDPC code is the smallest Hamming distance between the binary image representa- 
tion of any two codewords. 

5. Configurations for codewords with logarithmic weight 

We look at the necessary conditions for the linear minimum distance growth of non-binary LDPC code ensembles. Our approach 
towards this direction is to study configurations which correspond to low weight codewords. More precisely, we focus on codewords 
which have logarithmic weight. 

Consider a Tanner graph of binary LDPC code. We consider the sub graph induced by degree-2 variable nodes. Any cycle in the sub 
graph would correspond to a valid codeword. If length of the cycle is logarithmic, then it leads to logarithmic weight. It can be shown 
that when the number of degree 2 variable nodes in the code (712 is higher than the number of check nodes (m), there exists cycles of 
logarithmic weight (A (0)p (1) < 2)). So the necessary condition for better than logarithmic minimum distance growth is to ensure 
that, 712 < m. 

The structure of logarithmic weight codewords in non-binary case is however different from that of binary case. We have considered 
specific configurations involving cycles, union of cycles and chain of cycles. In the following section, we look at the structure of the 
sub-graphs induced by these configurations. 

6. Consistency of structured equations in ¥ q 

Here, we focus on the structure of the sub-matrices (in the parity check matrix) corresponding to the sub-graphs of the configurations 
of interest. The system of linear equations involving these structured matrices will then provide some clues on the behaviour of codes 
satisfying the parity conditions. 

First let us consider a linear system of equations in F 9 (A general linear system of equations in finite field is presented in 1101 
and special structures of equations in F2 is addressed in 1111 ). We are specially interested in a system of the form Ax = 0, where, 
A £ F^ c xT " and x € F^" . We associate the matrix A to a bipartite graph G with T v variable nodes and T c check nodes. 

For a linear system of equations Ax = 0, the relationship between the rank criteria and the number of possible solutions in F 9 is 
summarized in the following lemma. 

Lemma 5. If there are T v unknowns and T c equations (T v > T c . i.e., we have equal or more unknowns than equations), the number of 
solutions of x £ F q satisfying the linear system of equations Ax = where A G F^ c xT " is equal to q Tv ~ rank ( A \ 

If the matrix A is square (T c = T v ), then, there are q T c~ rank ( A ) solutions. When A is full rank (rank(A) = T c ), we have a unique 
solution and this unique solution is the all zero vector T " x 1 . 

Suppose we focus our interest to system of equations with special structure. More specifically, we look at systems with more 
unknowns than equations (T v > T c ). Let t = T v — T c + 1, We consider matrix A to have full rank rank(A) = T c . We also consider 
the system (of equations and solutions) constrained such that, each elements of them are non zero. Stated differently, this means that the 
elements of the x are all from F*, where F* = F g \ {0}. The number of such solutions of Ax = then is equal to (q — l)*~ . 
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6.1. Structure of matrices corresponding to cycles of associate Tanner graphs. . 

For bipartite graphs with cycles, there is an interesting structure in the corresponding parity check matrix. The underlying structure 
of the matrix corresponding to cycle in Tanner graph can be summarized in the following way 1121 . 

Definition 6. A 2g cycle matrix M is a g x g matrix over F g satisfying the following conditions: 

(1) There are exactly two non zero elements (S F 9 ) in each row of M 

(2) There are exactly two non zero elements (g F 9 ) in each column of M 

(3) For any square sub-matrix N C M, N does not satisfy the previous two conditions simultaneously. 
Example 7. A 4— cycle, along with the associated matrix representation is shown in Figure|4] The code is defined in F4. 
Example 8. A 6— cycle matrix representation defined over F4 is given by M3. 

Example 9. A bipartite graph and cycle representation of a 8— cycle matrix is shown in Figure[5] The code is defined over F4. 
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Figure 4. Single cycle: The smallest cycle (4— cycle) and its matrix representation are shown. Each row 
and column has exactly 2 non zero elements. The matrix is defined over F4. The primitive polynomial 
generating F4 is p(x) = 1 + x + x 2 . The primitive root is a and the elements of F4 are {0, 1, a, 1 + a}. 
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Vl + < 




Figure 5. A 8— cycle graph and the corresponding matrix representation are shown. All the columns 
and rows of this square matrix have exactly 2 non zero elements. The matrix is defined over F4. The 
primitive polynomial generating F4 is p(x) = 1 + x + x 2 . The primitive root is a and the elements of 

F 4 are {0,1, a, 1 + a}. 

It may be noted that, for sub-graphs which are cycles, the corresponding sub-matrix of the parity check matrix is square. We will 
soon see (in the following sections) that, for other structures such as joint cycles, the associated sub-matrices no longer stay square. 
In the case of single cycles, Poulliat (2006) et al have recently established the following result |9|. 
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Theorem 10. If eta, Q2, , ot-i are independent random variables which take the values 1, 2, . . . , q — 1 independently and with 

equal probabilities, then Pc, the probability that there exist solutions other than the non trivial {all zero) is given by, 

(3) Pc = — i-r. 

9 - 1 

6.2. Matrix structure of associated graphs with joint cycles. When the sub-matrix corresponding to the sub-graph (of a Tanner 
graph) has the following structure, that correspond to union of two joint cycles. 

(1) Every column has exactly two non zero elements. The non zero elements are the non zero elements of the field ¥ q over which 
the code is defined. 

(2) Exactly two rows have 3 non zero elements whereas the remaining rows have exactly 2 non zero elements. The non zero 
elements are the non zero elements of the field ¥ q over which the code is defined. 

Example 11. A matrix representation of a configuration with union of two joint cycles is shown in Figure|6] along with the bipartite 
graph and cycle representations. 




C5 

Figure 6. Joint cycles: All the columns of the matrix has 2 non zero elements. Two rows have 3 non 
zero elements, while the remaining rows all have 2 non zero elements. The matrix corresponding to 
this system is rectangular. The matrix is defined over F4. The primitive polynomial generating F4 is 
p(x) = 1 + x + x 2 . The primitive root is a and the elements of F4 are {0, 1, a, 1 + a}. 

In the case of joint cycles, the matrix assumes a rectangular shape (as against a square matrix representation of single cycle). The 
number of columns for such matrices are higher than the number of rows. In other words, the number of equations are less than the 
number of variables. 

Matrix representation correspond to union of more than one cycles is just an extension of the 2 cycle union. The matrix stay 
rectangular as it was the case for union of two cycles. 

6.3. Union of joint cycles. As discussed earlier, the union of joint cycles correspond to a linear system of equations, with certain 
structure. The linear system Ax = with A rectangular. Since the number of variables are more than the number of equations, there 
would exist multiple solutions. 

This is a linear system of equations of the form Ax — 0, where x is the solution to the kernel (null space) of A. The number of 
nonzero solutions of x satisfying Ax = is equal to (q — i) T - mnk ( A ) j n me case Q f t wo joint cycles in a graph, the rank(A) = 
T — 1, which lead to a total of (g — 1) solutions. In general, for t joint cycles, rank(A) — T — t + 1 and consequently, a total of 
(q — l) T_ ( T_ * +1 ) = (g — non trivial (non zero) solutions. 

A special case of joint cycles is chains of cycles. 
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Figure 7. Chain of cycles: On the left is the bipartite graph and on the right shown is the cycle view of 
the graph. Both represent the same graph. 



6.4. Chain of cycles. In the case of chain of cycles, two or more cycles are connected by a link involving series of check nodes and 
variable nodes. The connecting link will always have one variable node more than the number of check nodes. An example of chain of 
cycles is shown in Figure[7] 

Let us consider bipartite graph with variable node degree equal to 2. We partition the graph into two sub-graphs Si and S2. The set 
Si contains disjoint cycles, whereas 62 consists of chains. Let 77,1 be the number of variable nodes in Si and 7712 is the number of check 
nodes. In S2 we have chains. Several combinations are possible here (They are yet to be listed here). Let 71,2 be the number of variable 
nodes in S2 and let 7712 denote the number of check nodes in this partition. 

Clearly ni = mi. There are different configurations possible for chains. The worst case scenario would be 7712 = f^f-]. The best 
case (least number of check nodes in S2 would be 1. In the latter configuration, all the variable nodes forming the chain are connected 
to check nodes from Si . 

Let a be the average number of degree 2 variable nodes in a chain. Let C denote the number of chains. We are interested to find a 
relationship between 772, C and a. Let be the number of degree-2 variable nodes in chain i. The number of check nodes in chain i is 
then equal to at — 1. The total number of variable nodes in chain is 712. Similarlythe total number of check nodes in chain is nz. That 

|C| |C| 

is 7i2 = a,i and 7712 = a.i — 1 

i=l i = l 



Total number of variable nodes counted in all chains in graph 
Total number of chains in the graph 

n 2 
\G\ 

, 772 _ 7772 

M — — — 7 

a a — 1 

a(m 2 ) = (n 2 )(a-l) 
a 

T12 = 7712 ~ 

a — 1 



When C — 1, it is a special case (unique cycle). 
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7. Simulation 

In order to verify the existence of the (valid codeword) configurations discussed thus far, we have performed some simulations using 
non binary LDPC codes. The simulation setup consisted of the following. LDPC codes defined over F4 is used. The decoding algorithm 
chosen is belief propagation (BP), under Additive white Gaussian noise (AWGN) channel. Both (2, 3) — LDPC and (2, 4) — LDPC 
schemes are considered. We performed simulations for three different codelengths (n = 900, 1800 and 9000). Channel parameters are 
chosen such that, we are in error floor region (of the bit error performance). We considered error events less than 50. We have considered 
structure of codeword configurations corresponding to error events. Random permutations and recursive strategy to avoid loops with 
multiple edges are used. With limited number of simulations, we have discovered some codewords with cycle configurations. We 
have also identified certain codeword configurations with few other interesting structures. So far, we have not identified any codeword 
configurations with union of joint cycles, but we expect the need for longer simulation runs to identify more interesting structures. In 
(5), they have identified configurations with union of 3 joint cycles. However they have not reported any cases of union of 2-joint cycles. 
We are performing more extensive simulations to identify some of these configurations. 




Figure 8. Simulation result: Configuration with one cycle: 



8. Discussion on bounds 

In the case of binary LDPC codes, the minimum distance of a codeword length correspond to the smallest cycle in the graph 
representation of the code. The length of the smallest cycle in a graph is referred to as the girth| 13 1. Since a cycle of length t give 
rise to a codeword of weight t (just set the variable nodes involved in the cycle to 1 and all remaining variables to (8J). the girth of 
Tanner graph then directly infer the minimum distance. If the length of such cycle is logarithmic, then that lead to logarithmic minimum 
distance. In other words, logarithmic bounds on girth then imply logarithmic bound on minimum distance. 

For non-binary codes, the result is not immediate. For LDPC codes over ¥ q ,q > 2, cycles correspond to codewords with some 
probability. This probability decreases with increase in alphabet size (q). However, because of the higher degrees of freedom available 
in a non binary code, such codewords can almost always be avoided by appropriate choice of edge labels (In binary case, there is no 
degree of freedom available since 1 is the only non-zero value). Even in the case of 2— regular codes, cycles which correspond to valid 
codewords, can be avoided in non-binary case. In binary codes, a codeword with a cycle configuration cannot be avoided. 

On the other hand, codeword configurations which are joint cycles cannot be avoided. This in a way serve as the deciding parameter 
on the minimum distance of non binary codes. We can obtain a bound on logarithmic d m i n using similar approach as that of binary 
codes. 

8.1. Girth bound adapted to nonbinary codes. For binary codes, the bound on logarithmic d m in can be be proved as follows. We 
know that, a cycle of length t give rise to a codeword of weight t (simple assignment of 1 to all the variable nodes in the cycle and 
to every other variable nodes). Consider a tree structure of the graph (a tree stemmed from a check node), where we assume that, there 
are no cycles up to (maximum) depth td- Then, there is a cycle at depth td + 1. In other words, the maximum depth of the tree graph is 
td- The tree depth td is logarithmic in the codelength n if the number of degree-2 variable nodes (712) is more than the number of check 
nodes (m) fUl fl5j. 
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We can adapt the bound strategy to nonbinary codes, with a simple exception in the graph. Here, we begin with a cycle. We construct 
a tree-like graph stemmed from the check node of the cycle (Remember that the graph constructed thus far is not a tree-graph in the 
strict sense. What we constructed is essentially a tree structure which stemmed down from a check node of a cycle). Recall that, a 
codeword with a corresponding cycle configuration can be avoided in the non-binary case. Thus, a single cycle in the graph considered 
is not corresponding to a codeword. Let td be the depth of the graph, until when we have no further cycles. Then, at depth td + 1, we 
have another cycle and this new cycle form a joint cycle with the earlier cycle (from which the graph stemmed). The union of two joint 
cycle indeed form a valid codeword and this configuration cannot be avoided. By same argument as the girth bound for binary codes, 
we can arrive at the bound on logarithmic d m i n as td ~ 0(log n) if 712 > m + 1. Note that the only difference between this bound and 
that of the binary code (girth bound) is the additional term 1, which is due to the fact that, there is already a cycle in the graph within 
graph of depth td- Asymptotically (as n — > 00), the two bounds (binary and nonbinary) approach the same bound. 

9. Concluding remarks and scope for further work 

In this report, we have discussed the results of our preliminary study on the necessary conditions for the linear minimum distance 
growth of non-binary LDPC code ensembles. We have studied some specific configurations which lead to low weight codewords. Our 
simulations have helped to identify some of these configurations. It is hoped that, with extensive simulations, more such configurations 
could be identified. While girth of the Tanner graph directly provide a measure of minimum distance for binary codes, adapting the 
girth bound to joint cycle configuration help us to obtain a bound for logarithmic minimum distance, in the case of nonbinary codes. 
Asymptotically, these two bounds indeed behave similar. Because of the increased number of freedom available in nonbinary codes (to 
choose edge labelling), it is perhaps possible to achieve improvements on these bounds with appropriate edge labelling. 
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